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On the structure of a family of probability 
generating functions induced by 
shock models 

Satrajit Roychoudhury* ^ and Manish C. Bhattacharjee^ 

New Jersey Institute of Technology 

Abstract: We explore conditions for a class of functions defined via an in- 
tegral representation to be a probability generating function of some positive 
integer valued random variable. Interest in and research on this question is 
motivated by an apparently surprising connection between a family of clas- 
sic shock models due to Esary et. al. (1973) and the negatively aging non- 
parametric notion of "strongly decreasing failure rate" (SDFR) introduced 
by Bhattacharjee (2005). A counterexample shows that there exist probabil- 
ity generating functions with our integral representation which are not discrete 
SDFR, but when used as shock resistance probabilities can give rise to a SDFR 
survival distribution in continuous time. 

1. The Problem 

A surprising connection between a family of classic shock models (Esary et al. 
[2]) and the negatively aging nonparametric notion of strong decreasing failure rate 
(SDFR) (Bhattacharjee [1]) is exploited in this paper to investigate necessary and 
sufficient conditions for a class of functions defined via an integral representation to 
be a probability generating function (p.g.f ) of some positive integer valued random 
variable (r.v.). The question we investigate can be simply posed as follows. If Q is a 
probability measure on the half line, under what conditions is the function defined 

by 



(1-1) / , ^1 Qidy) 

Jo I- z + yz 



< z < 1, 

yz 

a p.g.f. of some positive integer valued r.v. N ? This problem arose in the context of 
investigating a strong negatively aging property, introduced and studied by one of 
us (Bhattacharjee [1]), and its preservation by a class of shock models. Wc show that 
a weak converse to the preservation result also holds and provide a counter example 
to illustrate the breakdown of the full converse. Our work can thus be viewed as a 
non-traditional application of reliability ideas in the sense that the question posed 
can be conceived independently of the reliability theoretic framework, while ideas 
from the latter proves fruitful for the solution. The only other work we are aware of 
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that is analogous in spirit, though entirely different from our work presented here, 
in terms of the specific question posed and results obtained, is a paper by Pakes 
[()] on the structure of a class of p.g.f.s and its connections to Markov branching 
processes. 

For any y in (0,1), recognizing the integrand to be the p.g.f. of a geometric 
distribution over the positive integers, the answer is clearly affirmative if the support 
of the mixing distribution Q is no larger than (0,1]. The case y = 1 corresponds 
to a mixing distribution degenerate at 1. For y G (0,1], we can think of N as 
conditionally geometric given y, so that (1.1) is the unconditional p.g.f of N, when 
the parameter y is randomized over (0, 1]. In other words, if Q(0, 1] — 1, then the 
function defined by (1.1) is a Bayesian's view of the p.g.f of N when y has a prior 
Q. In fact if {Xi, X2, . . .} is a sequence of binary exchangeable random variables, 
then for any integers n, k such that I < k < n, any {ii, i2, ■ ■ ■ , ik} C {1, 2, . . . , n}, 
by de Finetti's classic result, 

P{X,, ^X,, = ---^ X,, = 1, X^ - for J e {1, ... ,71} \ . . .,ik}} 

Jo 

for some probability measure Q in (0,1]; it easily follows that the random variable 

N ^ inf{?i > 1 : X„ = 1} 



indeed has the p.g.f. 



Q{dy) 



l~ z + yz 

for some unique probability measure Q supported by the unit interval. 



However, if the support of Q on K+ = [0, c») extends beyond [0,1], then the 
answer to our question is not obvious. Hence, we ask if Q(l,oo) > , can (1.1) 
still be a p.g.f. of a random variable A'^? The next section explores and answers this 
question. 



2. Motivation and main results 



Our problem is motivated by the following observations and results. Let 
(2.1) ~S{t)=±P,e-~^^'^^ 

k=0 

be the survival probability of the standard Esary-Marshall-Proschan (EMP) shock 
model (Esary et al. [2]), where failure is caused by shocks arising over time according 
a homogeneous Poisson process {N{t);t > 0} with intensity A > and the number 
of shocks(J) to failure has tail (shock resistance) probabilities Pk = P{J > k), k = 
0, 1, 2, . . . . Esary et al. [2] proved that all of the standard nonparametric positive 
and negative aging properties of J in discrete time are preserved by the survival 
distribution S = 1 ~ S in continuous time. 

Definition 2.1. (i) A non-discrete lifetime X with distribution function F has the 
SDFR(Strongly Decreasing Failure Rate) property if the tail (reliability) function 
F{t) ~ P{X > t) is a completely monotone function (Feller [3, 4]) on [0, 00). 
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(ii) A discrete non-negative integer valued random variable X is discrete SDFR 
if its tail probability {it„ = P{X > n) : n = 0, 1, 2, . . .}, with uq = 1 is a completely 
monotone sequence (Feller [•], 4]). 

A positive integer valued random variable X has the discrete SDFR property if 
the non- negative random variable {X — 1) is discrete SDFR. 

Lemma 2.1. The Laplace- Stieltjes transform of S in (2.1) is given by 

L{s)^(f>{X/{X + s)), s>0, 

where is the p.g.f of the random number J of shocks to failure. 
Proof. Follows by routine computations; viz., for s > 0, 

i<co _ °° _ \^ 

s-'{l-L{s)}= e-^'Sit)dt = ^A--- using (2.1) 

^0 + 

A '■ 



X + s '"\X + s 

k=0 ^ 



Now use the standard identity, 

gp,,^ = iz:M with . = ^e(o,i). 

^-^ 1 — z X + s 



□ 



A;=0 



The following characterization of the non-discrete SDFR property is known from 
Bhattacharjee [1]. 

Theorem 2.1. X ^ F is SDFR iff it has a representation 

F{t)= / {1 - e-^')G{dX) 



for some unique mixing distribution G continuous at zero or, equivalently 

X IS SDFR ^X 

where denotes equality in distribution, Y and Z are independent, Y is exponen- 
tially distributed with unit mean and Z is a positive random variable. 

Sketch of proof. By Bernstein's representation theorem of completely monotone 
functions (Feller [4]), there exists a unique, possibly substochastic, measure G on 
[0, oo) such that for t > 0, 



F{t) = / e-^*G{dX) = G{0} + / e-'*G(dA). 

We must have G{0} = 0, since F{t) ^ as i 0. □ 

If the shock resistance probabilities Pk in (2.1) are discrete SDFR, does the 
shock model survival probability inherit the SDFR property in continuous time? 
The answer is affirmative. 

Theorem 2.2. If Pk is SDFR, then S is SDFR. 
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Proof. Using Hausdorff's result (Feller [4], see Theorem 2, pages 222-224) on the 
equivalence of moment sequences of distributions on the unit interval [0, 1] and the 
complete monotonicity property of such sequences, 



for some probability measure F on the unit interval. Accordingly from (2.1), we 
have. 



for some probability measure H. Note, A > implies that the support of H is 
contained in [0, oo), even though F has no mass outside the unit interval. Theorem 



A necessary and sufficient condition for the EMP shock model survival distribu- 
tion S to be SDFR is given by the next result, that sets in perspective our question 
posed in Section 1. 

Theorem 2.3. The EMP Shock Model distribution function S is SDFR iff the 
number of shocks to failure has a probability generating function (j){z) — Ez"^ with 
a unique representation 



Pk is SDFR with Po = 1 ^ Pk = Jo p'' F{dp), 




2.1 now imphes that 5* is SDFR. 



□ 



(2.2) 




for some mixing distribution Q with support in (0, cxd). 
Proof. By Lemma 2.1, for s > 0, 




E{e-''^), where T has tail S{t) in (2.1) 






G{d9) (using Fubini's Theorem). 



Setting z 



A/(A + s) e (0, 1) as s e (0, oo), A > 0; this yields 




/ 

Jo 



■oo 



0A-1 



G{de), since G{0} = 



0A-i + (i-l) 




zY 
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where 9 - G and f = F - Q. 

Note that, since G is continuous at zero by Theorem 2.1, so must be Q. Hence 
Q{0) = 0. Also, since A > 0, clearly the mixing distribution Q of the mixing variable 
Y must have support contained in (0, oo). □ 

At this point an obvious question is: what more can we say about Q? In particu- 
lar, what should be the support of Q7 Are there necessary and sufficient conditions 
on Q such that the right hand side of (1.1) is always a probability generating func- 
tion (p.g.f.)? To explore such conditions, a nontrivial observation is that Q cannot 
have zero mass in (0, 1]. To check this claim, rewrite (2.2) as 

/>oo 

0(z) = / H{y,z)Q{dy), 
Jo 

where the integrand 

H{y,z)^- 2/>0, 0<z<l, 

1 — z + zy 

is concave t in y > 0, for each z G (0, 1]. Thus, since 



inf i7(?/,z) = i7(0+,z) = 0, and ini H{y, z) ^ H{1, z) 

0<y<l y>l 



we have, 



/•OC 

,^(z)>/ inf i/(2/,z)Q(d2/) = zQ(l,^), 
Ji y>i 

so that, Q(l,oo) < z~^(/)(z), for < z < 1; whence, it follows that 
g(l, oo) < lim ^ = 0'(O) = pi= P(J = 1), 

z-fO+ z 

where J is the number of shocks to failure in the EMP shock model (2.1), with tail 
probabilities Pk- This must imply Q(0, 1] > 0; for if not, then 1 > pi > Q(l, cxd) = 1. 
Thus J = 1 w.p. 1 and hence (f>{z) = E{z'-') = z, which in turn, in virtue of (2.2) 
implies Q{1} = 1 and contradicts the hypothesis Q(0, 1] = 0. 

Similarly, using the monotonicity in y of the integrand in (2.2), and by noting 
suPo<y<i H{y, z) ^ H{\, z) = z, and supj^^;^ H{y, z) = liniy^oo H{v-, z) = 1, we see 
that 0(z) defined in (2.2) satisfies 

(t){z) < zQ(0,l] + Q(l,c»). 

Hence if the entire mass of Q lies in the unit interval (0, 1], then (j>{z) < z, so that 
the graph of the p.g.f. (/"(z) lies entirely below the diagonal in the unit square. In 
other words, the number (J) of shocks to failure dominates the degenerate random 
variable constant at 1 in the generating function order (Miiller and Stoyan [-5] ) . 

These observations still beg the question: can Q allocate positive mass to (1, oo), 
while 0(z) in (2.2) still remains a p.g.f.? In attempting to answer this question, the 
following lemma will prove useful. 

Lemma 2.2. For any non-negative integer valued random variable N with distri- 
bution, Qn = P{N = n); n = 0, 1, 2, • • ■ we have 

E{l~zf = Y,Ckz'', where Ck = i-lfY.(l)^^^ < z < 1. 
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Proof. Simply note, 

oo 



OO 71 

' n 



oo oo 



k—O n—k 

This leads to the following necessary condition. 

Theorem 2.4. // the integral in (1.1) is a probability generating function of a 
positive integer valued random variable N, then Q cannot have support beyond 
(0,2). 

Proof. By Theorem 2.3, if the integral in (1.1) is to be a p.g.f., then it must corre- 
spond to the p.g.f. ipiz) of the number of shocks to failure in the EMP shock model 
(2.1). Hence, 

0(2) = ^( i„^ + ^y ) ' whereO<y^Q 



^y{z)Q + {l-Vi(l-2)}Q(d2/), 



where ipyi^) = Ez '« , Ny ~ Geometric{y) , < y < 1 with mass function P{Ny 



k) =y{l-yf \ k> 1. Thus 



(2.3) (biz) = 



Ez''yQ{dy)+ {1- £{1 - z) ^}Q{dy) 



for < z < 1. Hence, for the EMP Shock Model, the generating function of the 
shock resistance probabilities Pk = P{J > k), must be 

M(z)^Vp,2^ = i-M, 0<z<l 

^ — ^ — r 



fc=0 



(2.4) = / , Qidy) + / E{l~zfyQ{dy), 

Jo ^ ^ Ji 

where N* = iVi - 1, y > 1. 

J y 

The integrand in the first term of (2.4) is easily seen to be {1 — z(l — y)}^^ = 
^^q(1 — yYz'^ for < y < 1 and < z < 1. To evaluate the second term in (2.4), 
use Lemma 2.2, to get 

g„ = P{N; ^ n) - P{N~ = n + 1) = n(l - H)". 
where H = < 1 and rt = 0, 1, 2, . . . . The coefficients c^, defined in Lemma 2.2, 
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in the expansion of E{1 — z)^y , are 

c. = (-ifnf;(^ + ^')(i-nr^ 

= (-1)''— ^^P((fc + l)st success in(A: + j + l)st trial) 

(2.5) = =(l-2/)^ 

since n = 1/^^ < 1. Note, the right hand side of (2.5) is positive for k even and 
negative for k odd. Thus (2.4) can be rewritten as, 



k=0 •'^ U=0 



(2.6) y-Pkz'^ I {y:{l-yfz'}Q{dy) 



/oo ( oo "j 



In the first term of the right hand side of (2.6), the series is absolutely convergent. 
Hence the integral and the summation can be interchanged. But the series in the 
second term converges iff y < 2. This implies, if Q[2;CX3) > then the right hand 
side of (2.6) diverges. Thus, Q[2,oo) = 0. □ 

Theorem 2.4 allows us to have a representation of the shock resistance probabil- 
ities Pfc via the mixing distribution Q. From (2.6), note that, for all z g (0,1), 



oo oo ( /•2 

(2.7) ^Pfcz^- = E / 



z\ 



where 

(2.8) a(j/) = l-y: 



(1 - y) > 0, for < y < 1, 
-(y-l)<0, for 1< y < 2. 



Equating coefficients of z^ on both sides of (2.7), for /c = 0, 1, 2, ... we have 

(2.9) Pk = f a\y)Q{dy), fc = 0,1,2,.... 

Jo 

Theorem 2.4 implies that for the function (f){z) defined by (1.1) to be a p.g.f., 
we must have Q[2,oo)=0. Is this the sharpest possible result? Or is there a sharper 
necessary condition? We do not know. A counterexample at the end of this section 
exhibits a mixing distribution Q with support strictly larger than the unit interval, 
but the right end point of support also falls strictly short of 2. The counterexample 
does establish however that, as expected, the full converse to Theorem 2.4 is false. 

For what conditions on the mixing distribution Q, is a tail probability of a 
discrete non-negative random variable (not necessarily discrete SDFR)? In other 
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words, what conditions on Q would ensure that Pk is non- negative and monoton- 
icaUy decreasing? Note, we aheady have Pk directly from (2.9). Interestingly, 
a weak converse to Theorem 2.4 does hold under a condition which turns out to be 
directly relevant to our original question posed in Section 1. 

Theorem 2.5. Suppose that the shock model distribution S in (2.1) is SDFR. Then 
the corresponding number of shocks to failure has the discrete SDFR property if and 
only if the support of the mixing distribution Q in the representation of its p.g.f in 
(2.2) is contained in (0, 1]. 

Proof. Given that the shock model probability S in (2.1) is nondiscrete SDFR, we 
have the representation in (2.9) of the shock resistance probabilities, which can be 
expressed as 

(2.10) Pk = ^{(1 - y)'= l{o<F<i} + (-l)'(^ - 1)' l{i<Y<2}} = 

where V = I ~Y and Y ^ Q with support in [0,2). Thus -1 < ^ < 1, w.p. 1, 
with d.f. F, given by 

F{v) 1 - Q{l~v-0), for -1< w < 1. 

Accordingly, Pk is a moment sequence 

(2.11) Pk= [ v'^Fidv), fc = 0,1,2,..., 

J{\v\<l} 

of a distribution F with support in (—1, 1). On the other hand, by the classic result 
of Hausdorff (Feller [ ]). Pk is discrete SDFR (or, a completely monotone sequence 
satisfying Pq = 1) <==^ Pk is the moment sequence of a distribution with support in 
[0, 1]. Since d.f.s with bounded support are uniquely determined by their moment 
sequence, this implies V must have support in [0, 1]. Hence, 

= p{-i < y < 0) = F(i< y < 2). 

In conjunction with Theorem 2.4, this implies that Q must be supported by the 
unit interval. 

Conversely, suppose Q(0, 1] = 1. Then the representation (2.9) of the shock 
resistance probabilities reduces to 

Pfc= / (1-2/)' Q{dy), fc = 0,1,2,.... 

JQ 

Thus Pk constitutes the moment sequence of a unique probability measure in the 
unit interval. Again, in virtue of Hausdoff 's theorem, as remarked in the proof of 
Theorem 2.2, this is equivalent to Pk being discrete SDFR. □ 

Contrary to crude intuition, the mixing distribution Q can have positive mass 
in (1,2) and the corresponding (j){z) can still be a p.g.f. as Theorem 2.4 suggests 
and the following counterexample demonstrates. In view of Theorem 2.5 however, 
it can no longer have the discrete SDFR property. 

Counterexample . We construct a mixing distribution Q on the half line such 
that (2.9) yields a tail probability sequence. Note that in view of (2.10), it is enough 
to find a distribution of a random variable V with support in (—1,1) such that 
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its moment sequence EV'' is monotonically decreasing in k = 0,1,2,..., which 
requires 

EV^" > > EV^"+^, n = 0, 1,2,.. ., where - 1 < < 1 w.p. 1. 

Only the second incquahty needs to be verified for a counterexample, since the first 
one is free. Setting U = -V, we see that EV^"^ > E'F^n+i 

E{l{o<u<i}U'^" + l{o<y<i}^^"} 

> E{- l{o<c/<i}C/'"+' + l{o<v<i}V''"+'} 
^^{l{o<c/<i}C/'"(l + C/)} 

(2.12) >-i?{l{o<y<i}^'"(l~^)}, 

which clearly holds, since the left hand side is positive, while the right hand side is 
non-positive. Now set V to have a density given by 

(2.13) f{v) = ^l{_„<„<oj + (1 - /3)l{o<„<i}, 

with discontinuities at w = —a and 0. The corresponding d.f. with parameters 
a > 0, /3 > 0, is 



F{v) 



0, if w < —a, 

/3(1 + wa-i), if -a<v <0, 

P)v, if < i; < 1, 

1, if V > 1. 



We need to verify that there exist a choice of (a, /3) G (0, 1)^ such that ijy^^+i > 
-gy2n+2 gyery n > 0. Computations analogous to those leading to (2.12) shows 
that this is equivalent to demanding 



+ ^^»+^) f{-v)dv < / (w^"+^ - z;^"+^) f{v)dv. 
Jo 

For the density in (2.13), the left hand side equals 

^ /"(.--^ + .-+^) dv = 2n(l + .) + (3 + 2a) 

aio J (2?i + 2)(2n + 3) 

while the right hand side equals (1 — /3){{2n + 2)(2n + 3)}^^. Thus we need to show 
that it is possible to have, 

(2.14) /3a2"+i{2n(l + a) + (3 + 2a)}< (1-/3), n = 0,1,2,.... 
If we choose (a,/?) such that 

then (2.14) would be assured under (2.15), provided 

(2.16) 2na^"{l + a) < (1 - a'"), n>l. 

ap 
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This is indeed feasible for a wide range of (a,/3) £ (0, 1)^. For example take 1/3 < 
/3 < 1, and choose a > such that a(3 + 2a) < 2, thus satisfying (2.15). In 
particular, choosing < a < 2/7 is enough, since this would imply 

o 

< a < 2/7 < 



3 + 2q; 



For such a choice of a, the right hand side of (2.16) is bounded below by {7(1 — 
(2/7))^"}, whereas the left hand side is bounded above by 

2n(2/7'")(9/7) < 3n(2/7)'" < 3n(l/3)'" = nS^'^n-i) ^ ^ ^ i^^^ _ (2/7)'")}, 

the lower bound to the right hand side of (2.16). This establishes our claim that 
the class of d.f.s with support in (—1, 1) whose moment sequence defines a discrete 
tail probability is a nonempty class. 

Finally, to get a counterexample of a mixing distribution Q with support in [0, 2) 
such that (5(1, 2) > 0, take Q to be the d.f. of F = 1 — V where V has density as in 
(2.13). The corresponding continuous d.f. is Q{y) ~ P{V > 1 — y) = 1 — F{1 — y), 
for < y < 1 + a. Thus; 



Qiy) 



0, if y< 0, 
{i-P)y, ifO<y<l, 
(l-/3)+a-i/3(y-l), if l<y<(l + a), 

1, if y > 1 + a. 



Using this in (2.9) leads to, 

and defines a tail probability for < a < 2/7 and f3 > 1/3, as we have argued. 
Note, the support of Q exceeds the unit interval. In particular, choosing (3 = 2/3 
and a = 1/7, we have A^Pi = —0.00523. Thus Pk is not a completely monotone 
sequence, and hence is not SDFR. 

In closing, we note that when the integral in (2.2) is indeed a p.g.f., it can 
be bounded below (and above, when EY < 1) by simple geometric p.g.f.s. The 
Laplace-Stieltjes transform of the EMP shock model distribution S in (2.1), can be 
correspondingly bounded by Laplace transforms of suitable exponential distribu- 
tions. In stochastic comparison parlance, these findings can be stated in terms of 
stochastic domination via the "generating function order'^ <g and "Laplace trans- 
form order'' <l (for definitions, see Miiller and Stoyan [5]) 

Let Na be a geometric r.v. on the positive integers, with mass function P{Na = 
k) — a(l — a) fc > 1, as defined in proof of Theorem 2.4, and let (exp)^ denote 
an exponential distribution with mean fi. 

Theorem 2.6. For the EMP shock model distribution S in (2.1) and the corre- 
sponding number of shocks (J) to failure, we have 

J <g ^(EJ)-' '^"^ ^ <L {exp)f,=x-^Ej 
If EY < 1, then, N(^ey) <g J o.nd {£xp)u=(\EY)-^ <L S. 



The bounds are sharp, with equality attained when J — 1 w.p. 1. 
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Proof. Note that the expected number of shocks to failure, satisfies 

1 - 0(z) 



1< EJ = hm 

2^1- 

(2.17) = hm 



1- 1 - z 

Q{dy) _ rQ{dy) 



1- 



1-z + zy Jo y 



by the dominated convergence theorem, since the integrand is | in y, for each 
z G (0,1) and is thus bounded above by the constant (1 — z)~^ . We note in 
passing that, using the necessary condition in Theorem 2.4, we could argue : 
EJ = y^^Q{dy) > ^, which however is superseded by the sharper bound 
EJ > 1, since J > 1, w.p. 1. In fact (2.17) together with Theorem 2.4 now implies 
Jo y~^Q{dy) > 1, which implicitly restricts the amount of mass Q can allocate to 
the interval (1,2). 

Finally, appealing again to the concavity in y, and hence convexity in x = y~^ , 
of the integrand in (2.2), for each z € (0, 1); standard Jensen's inequalities give, 



1- z + z{EY) - ' + (1 - zW^^ J - z+{l- z){EJ) ' 

using (2.17). We note that while the upper bound is always valid, it is a geometric 
p.g.f. only when EY < 1. The lower bound is of course the p.g.f. of a geometric 
distribution with parameter {EJ) ^ . 

The Laplace ordering claims now follow from (2.18), by using Lemma 2.1 and the 
substitution z = A/(A + s). All bounds are easily seen to be sharp, being attained 
when the first shock causes failure (when J = 1, w.p. 1). □ 

Acknowledgment. It is a pleasure to dedicate this paper in honor of Professor P. 
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